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Abstract 

In this paper, we prove results on enumerations of sets of Rota-Baxter words 
in a finite number of generators and a finite number of unary operators. Rota- 
Baxter words are words formed by concatenating generators and images of words 
under Rota-Baxter operators. Under suitable conditions, they form canonical bases 
of free Rota-Baxter algebras and are studied recently in relation to combinatorics, 
number theory, renormalization in quantum field theory, and operads. Enumeration 
of a basis is often a first step to choosing a data representation in implementation. 
Our method applies some simple ideas from formal languages and compositions 
(ordered partitions) of an integer. We first settle the case of one generator and 
one operator where both have exponent 1 (the idempotent case). Some integer 
sequences related to these sets of Rota-Baxter words are known and connected to 
other combinatorial sequences, such as the Catalan numbers, and others are new. 
The recurrences satisfied by the generating series of these sequences prompt us 
to discover an efficient algorithm to enumerate the canonical basis of certain free 
Rota-Baxter algebras. More general sets of Rota-Baxter words are enumerated with 
summation techniques related to compositions of integers. 
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1 Introduction 



This paper studies enumeration and algorithms related to generation of sets of 
Rota-Baxter words which occur naturally as canonical bases of certain Rota- 
Baxter algebras. 

In the 1950s, Spitzer proved a fundamental identity on fluctuation theory in 
probability by a nalyti c methods. The field of Rota-Baxter algebra was started 
after G. Baxter ( ]1960| ) showed that Spitzer's identity follows more generally by 
a purely algebraic argument for any linear operator P on an algebra satisfying 
the simple identity 

P{x)P{y) = P{xP{y) + P{x)y + \xy) (1) 

for all elements x, y of the algebra, where A is a constant. ^ Rota s tudie d 
this operator through his many articles and communications (see Rota ( 119951 ), 
for example). As a recognition to the great contribution of Rota and Baxter, 
such a linear operator P is called a Rota-Baxter operator and an alge- 
bra with the operator is called a Rota-Baxter algebra. In spite of diverse 
applications of Rota-Baxter algebras in mathematics and p hysics , the study 



of Rota-Baxter algebra itself has been highly combinatorial. iRotal ( 11969 ) and 
Rota and Smith ( 119721 ), for instance, related Rota-Baxter operator to other 
combinatorial identities, such as the Waring formula and Bohnenblust-Spitzer 
identity. Explicit constructions of free commutative Rota-Baxter algebras have 
played an important role in further studies , from Cartier (1 1972 ) and Rota (1 1969 ) 
in the 1970s, to Iguo and Keigherl (| 2000cj : 2000bl ) in the 1990s. Because of the 
combinatorial nature of the construct ions, the related enumeration problems 
are interesting to study. For example, Guol ( 120051 ) showed that free commuta- 
tive Rota-Baxter algebras on the empty set are related to Stirling numbers 
of the first and second kind, and these results in general provide generating 
series for other number sequences. 



The unexpected application of non- commutative Rota-Baxter algebras in the 
work of Connes and Kreimer (1 2000 : |200l| ) and Ebrahimi-Fard,Guo and Kreimer 
(1 2004 : 12OO5I ) on renormalization of quantum field theory moves the construc- 
tions of the corresponding free objects to the fore front. Such constructions 
were obtained recently by Ebrahimi-Fard and Guo ( 120041 : |2005 ), providing a 



We will take A = — 1 in this paper. Some authors use —6 in place of A. 
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fulle r understanding of the connection first made by Aguiar (see Aguiarl (1 2000 ) 
and lEbrahimi-Fard (1 20021 )) between Rota-Baxter algebras and dendriform al- 
gebras of Loday and Ronco ( 120011 ), and in particular their Hopf algebra of pla- 
nar trees. Free nonunitary Rota-Baxter algebras are convenient in the study 
of the adjoint functor from dendriform algebras to Rota-Baxter algebras. 



We consider enumeration and algorithmic generation of sets of strings called 
Rota Baxter words (RBWs) which represent expressions in generators and unary 
operators. Under suitable conditions, these sets form the canonical bases for 
(non-unitary) free Rota-Baxter algebras with finitely many operators and 
finitely many generators. These constructions of free Rota-Baxter algebras 



are o nly recently explored in special cases in lEbrahimi-Fard and Guol ( 12004 



2005 ) and we enumerate not only their canonical bases but also sets that may 
be useful for more general yet-to-be-explored constructions. For this purpose, 
we apply concepts and methods from formal languages, grammars, composi- 
tions of integers, and generating functions. While most of the sequences and 
double sequences are found to be closely related to the Catalan numbers, we 
found sequences that are not covered in the Sloane data base. The enumeration 
study leads us to an algorithm that generates canonical bases for certain free 
Rota-Baxter algebras. These algorithmic explorations of enumeration meth- 
ods (either sequential or randomized) are the first steps that must precede 
development of important software tools in symbolic computation packages 
that allow further investigations of algebraic properties of related algebras 
based on Rota-Baxter words. In addition to Rota-Baxter word sets in sev- 
eral free operators and free generators, we consider sets of RBWs where the 
number of consecutive applications of operators and the number of consec- 
utive generators are bounded. These RBWs contain interesting combinatorial 
structures re lated to planar rooted tre es which form Rota-Baxter algebras as 
developed in Ebrahimi-Fard and Guol (120051). Some of these properties in cer- 
tain cases are studied by lAguiar and Moreira ( 120051 ). Specifically, Aguiar and 
Moreira used bijections between combinatorial objects while we applied direct 
algorithmic enumerations. 



The rest of the paper is organized into sections dealing with three levels of 
generalities with each level built upon the previous one. Section 2 briefly re- 
views concepts on a free Rota-Baxter algebra and its canonical basis consisting 
of Rota-Baxter words. Section 3 deals with the case of one idempotent oper- 
ator and one idempotent generator (that is, exponent 1 case). In Section 4, 
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we generalize this to arbitrary exponents for one operator and one generator. 
In Section 5, we further generahze this to arbitrary number of operators and 
generators, with arbitrary uniform exponents relations. We end with a brief 
remark on future research. 



2 Background and notations 



Let k be a commutative unitary ring and let A be a non-unitary k-algebra. 
A non-unitary free Rota-Baxter algebra over A is a Rota-Baxter algebra 
{F{A), P4) together with a k-algebra homomorphism '■ A F{A) with the 
property that, for any Rota-Baxter algebra {R, P) together with a k-algebra 
homomorphism f : A R, there is a unique homomorphism / : F{A) — > R 
of Rota-Baxter algebras such that f = f o Ja- 

Ebrahimi-Fard and Guo ( 120041 : |2005l ) explicitly constructed a free, non- commut- 
ative, Rota-Baxter algebra over A, denoted by 111'^'^'° (A), in the k-basis 
X of the k-algebra A exists and is given. Since the enumeration of a k-basis 
of III^^'^{A) is the main subject of study of this paper, we briefly recall its 
construction. In what follows, the product of xi,X2 G X in the algebra A is 
denoted by X1X2 or by xi ■ X2 if clarity is needed, and the repeated n-fold 
product of a; G X in the algebra A is denoted as usual by x". 

Let L and J be symbols, called brackets, and let X' = X U {[, J}. Let ^(X') 
be the free (non-commutative) semigroup generated by X'. We can view an 
element u G 5'(X') as a string made up of symbols from X'. The product of 
two elements m, f G S{X') is, by an abuse of notation, also denoted by the 
concatenation uv whenever there is no confusion, and by explicitly using the 
concatenation operator U as in m U f otherwise. It should be emphasized that 
the operator U is not a symbol in X' and is used solely for the purpose to 
resolve ambiguity in cases when m = xi G X and v = X2 & X . We will adopt 
the convention that the notation uv = X1X2 always means the product Xi-X2 in 
the algebra A (and it may happen that X is closed under algebra multiplication 
so that X1X2 G X), and the concatenation of Xi with X2 as elements of ^(X') 
will always be denoted by xi U X2- As we shall see, concatenation of two (or 
more) elements of X are explicitly excluded in the RBW sets, in particular, in 
any canonical basis of III^'-^'''(y4). So such usage is very limited. 
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Definition 2.1 A Rota-Baxter word (RBW) w is an element of S{X') that 
satisfies the following conditions. 

(1) The number of [ in equals the number of J in w; 

(2) Counting from the left to the right, the cumulative number of [ at each 
location is always greater or equal to that of J ; 

(3) No subword Xi U X2 occurs in w, for any Xi,X2 G X; 

(4) No subword J [ or [J occurs in w; 

Interpreting Definition 2.1, a Rota-Baxter word w can be represented uniquely 
by a finite string composed of one or more elements of X, separated (if more 
than one x) by a left bracket [ or by a right bracket J , where the set of brackets 
formed balanced pairs, but neither the string J [ nor the string [J appears as 
a substring. For example, when X = {x}, the word w = [[x\x[x\\x\_x\ is an 
RBW, but [s U xj, [x^J, \_x\ \_x\, x\x\_x, and are not. The number of 

balanced pairs of brackets in an RBW is called its degree. The degree of w in 
the above example is 4. 

Let OJl°(X) be the set of Rota-Baxter words and let Tl\X) be Tl^{X) with 
the empty (or trivial) word adjoined. 

Example 2.2 Let k be a field. Let A = k[x] be the polynomial ring in one 
indeterminate x over k. Then X = {x" | n G N} is a k-basis. In this case, if 
a + b = n, then [a;"J = [x°x^J is an RBW, but [x"" U x^\ is not. 

Example 2.3 Let B = k[x]/a be the quotient ring of A of Example 2.2 by 
the ideal a generated by — x. Then writing 1 = 1 + a and x = x + a, the set 
X = {1, x} is a k-basis of B. Here [Ij , [1 ■ xj , and 1 [xj are RBWs but [1 U xj 
is not. 

Let i xjNC.o^^^ ^]^g fj;.gg k- module with basis OK°(X). Ebrahimi-Fard and 
Guo ( 120041 : |2005| ) show that the following properties 



X o x' 


= X ■ x' 


X o [w\ 


= x[w\ 


[w\ O X 


= [w\x 



(2) 

[w\ o [w'\ = I lw\ o w'\ + [w o [w'l J + X[w ow'\ 
for all X, x' G X and all w, w' G 971° (X) uniquely define an associative bilinear 
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product o on in^^'°{A). This product, together with the hnear operator 

Pa : m^^'°(>l) ^ m^^'°(^), Pa{w) = [w\ if w e Tf{X), (3) 
and the natural embedding 

makes in^'"'^{A) the free (non- unitary) Rota-Baxter algebra over A. We will 
not need to know the explicit construction of this associative bilinear product 
in 111^'^'° (A) for the rest of the paper. However, we note that as an clement of 
the algebra the string \_w\ may be interpreted as the image of the 

operator P4 on w for any w G 971° (X) and that for any such w writable as 
the concatenation uv for u,v & 9Jl°(X), the concatenation can be viewed as 
uov (the first three cases of Eq. (2)) and this justifies the abuse of notation 
and convention in using concatenation for both the algebra multipication in 
A and the semigroup product in S{X'). 

Example 2.4 Let k be a field. Let A — k(xi, . . . , x^) be the polynomial ring 
in q non-commutating indeterminates xi, . . . ,Xq over k. Then the set X of 
(non-commutative) monomials is a k-basis of A. If g ^ 2, then [x|xfx|J = 
[xl ■ xf ■ xl\ is an RBW, but Uxfu x|J is not. Moreover, in m'^<^'°(A), we 
have loxi — Xi — Xi<>l and 1 o [w\ — 1 [w\ for any w e S{X'). 

Definition 2.5 The free (non-unitary) Rota-Baxter algebra III^'"'*'(A) of Ex- 
ample 2.4 will be denoted by III^'-^'°(g) and referred to as the free (non- 
unitary) Rota-Baxter algebra on q generators The corresponding 
k-basis 9Jl°(A) consisting of RBWs built from X will be denoted by 971" (g). 
Any non-commutative monomial x E X will be called an x-run. For any RBW 
w G 971" (g), the arity of w is the number of Xi, . . . ,Xq appearing in w, counted 
with multiplicities. For example, the arity of an x-run is the total degree of 
the monomial it represents and the RBW w = Xix\ [_x%x\xW has two a;-runs and 
arity 12. 

Example 2.6 Let A — k(xi, . . . ,Xq)he as in Example 2.4, let v — (vi, ■ ■ ■ ,Vq) 
be a vector of q positive integers, and a be the bilateral ideal of A generated 
by the polynomials x^*'^^ — Xi, 1 ^ i ^ q. Let B be the quotient k-algebra 
A/ a. Writing 1 — 1 + a and Xi — Xi + a, let X be the set consisting of all 
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non-commutative finite power products 

in aJj (1 ^ i ^q)i where the indices satisfy 7^ ji and 7^ j^+i for alH 
(2 ^ £ ^ r — 1), and the exponents satisfy 1 ^ tj^ ^ Vj^ for £ (1 ^ £ ^ r). 
Then X is a k-basis of B and 111'^'-^'° (5) is a free Rota-Baxter algebra on B. 

Definition 2.7 The free (non-unitary) Rota-Baxter algebra m^'^'^{B) of Ex- 
ample 2.6 will be denoted by v) and be referred to as the free (non- 
unitary) Rota-Baxter algebra on q generators Xi, . . . ,Xq with exponent vector 
V. The corresponding k-basis 'OK^{B) consisting of RBWs built from X will be 
denoted by 9Jt''(g, v). Any non- commutative monomial x ^ X will be called an 
x-run. For any RBW w e 931° (g, v), the cirity of w is the number of afi, . . . , afg 
appearing in a canonical representation of w as an element of 5, counted with 
multiplicities. For example, the arity of an x-run is the total degree of the 
monomial it represents and the RBW w = Xix|[x|x^x|J has two x-runs and 
arity 12, provided ^ 4 and ^2 ^ 3. 

In this paper, we will enumerate RBWs in Wt^{q,v) (actually ^M^{q,v), after 
adjoining the empty RBW 0) with a given degree and arity, by giving algorithms 
to generate them and generating functions that count them. We begin with 
q — 1 and i>i = 1 in Section 3 under some extra hypothesis by restricting to 
a subset of RBWs, but generalize the results to arbitrary q and RBWs involving 
multiple unary operators. For some of these generalizations, we note that the 
corresponding free Rota-Baxter algebras have not been constructed and the 
enumeration of the sets of RBWs is included for possible future applications. 

3 One idempotent operator and one idempotent generator case 

In this section, we restrict ourselves to Example 2.6 when q = 1 and v = (1), 
that is, X is idempotent, and we further assume that the Rota-Baxter operator 
P is also idempotent (that is, P{P{w)) = P{w) for all w). These restrictions 
allow us to first focus on the word structures of free Rota-Baxter algebra 
constructions before considering other factors involved in more general Rota- 
Baxter words. Interestingly, in most applications of Rota-Baxter algebra in 
quantum field theory, the operators are idempotent. 
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We contribute three results for the enumeration of a canonical basis of the free 
Rota-Baxter algebra in this special case. After reviewing some preliminary 
material and setting up notations, we consider generating functions based on 
the degree of the Rota-Baxter words in Section 3.1. In Section 3.2, we refine 
the study to consider generating functions based on the degree and arity. In 
Sections 3.3, we give an algorithm to generate this canonical basis with given 
degree and arity. 

For simplicity, we will drop the bar notation above the generator x. Under 
our current hypothesis that both the single generator x and the operator P 
are idempotent, let it! = it!i^i be the subset of 93T^(X) consisting of and 
Rota-Baxter words w composed of x's and pairs of balanced brackets such 
that no two x's are adjacent, and no two pairs of brackets can be immediately 
adjacent or nested. In other words, the strings J [, [J, and strings of the form 
[[*JJ where the brackets are balanced pairs and where * may be any RBW, do 
not appear as substrings of w. For example, the RBW [[a^JJa; is not element of 
R. For the rest of this section, all RBWs are assumed to be in R. 

3. 1 Generating functions of one variable 

Let R{n) be the subset of R of degree n (with our convention, i?(0) = {0, x}). 
For n > 0, let B{n) be the subset of R{n) consisting of RBWs that begin with 
a left bracket and end with a right bracket. Words in B{n) are said to be 
bracketed. By pre- or post- concatenating a bracketed RBW w with x, we 
get three new RBWs: xw^ wx, and xwx, which are called respectively the left, 
right, and bilateral associate of w. We also consider x to be an associate 
of the trivial word 0. Any non-trivial RBW is either bracketed or an associate. 
Thus for n > 0, the set A{n) of all associates form the complement of B{n) in 
R{n) and it is the disjoint union of these cosets: 

A{n) ^ xB{n) Li B{n)x[JxB{n)x, n>0. 

The set of bracketed RBWs is further divided into two disjoint subsets. The first 
subset I{n) consists of all indecomposable bracketed RBWs, whose beginning 
left bracket and ending right bracket are paired. The second subset D{n) 
consists of all decomposable bracketed RBWs whose beginning left bracket 
and ending right bracket are not paired. For convenience in counting, we define 
S(0), 7(0), D{0) to be the empty set and note that ^(0) = {a;}. 
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The following table lists these various types of RBWs in lower degrees. 



deg 


lin) 


D{n) 


A{n) 


B{n) 









X 




1 


[x\ 




xlx\ , [x\x, X [x\x 


[x\ 


2 


[x [x\\, [ [x\ x\ , [x [x\ x\ 


[x\ X [xj 


12 associates 


/(2)UD(2) 



In terms of formal languages, we start with an alphabet S of terminals 
consisting of a special symbol and the three symbols [, x, and J , a set of non- 
terminals consisting of (bracketed), (indecomposable), (decomposable), 
(associate) and the sentence symbol (RBW). Let the production rules be: 



(RBW) 
(associate) 
(bracketed) 
(indecomposable) 
(decomposable) 



I (bracketed) | (associate) (4) 
X I x(bracketed) | (bracketed)x | x(bracketed)x (5) 
(indecomposable) | (decomposable) (6) 
[ (decomposable) J | [ (associate) J (7) 
(bracketed)^ (bracketed) (8) 



By (5) and (7), it is clear that the sentences in this language will be RBWs and 
vice versa. The production rules thus define a grammar whose language will 
be U^o ■ For the benefit of readers unfamiliar with this area of computer 
science, we refer them to Aho and UUmanl ( 11972 ) for basics and now give a 
detailed proof. 

Proof. Let £ be the language defined by the grammar above. Clearly, -R(O) C 
£. It is easy to see by induction on the length of the string representing an 
RBW w that w G £ in case w is either indecomposable or an associate since 
by Definition 2.1, removing the outermost pair of balanced brackets from an 
indecomposable RBW or the appended x (or x's) from an associate will yield 
an RBW of shorter length. If w is bracketed and decomposable, then we may 
write w = \ w'\ where the beginning and ending brackets are not paired and 
w' G S{X'). Let u be left subword of w ending in and including the J that 
balances the beginning [ of w. Then u is an indecomposable RBW by definitions. 
Again by Definition 2.1, the next symbol in w following u must be x, which 
is then followed by a [ matching the ending J . Thus we may write w = uxv 
where m, v are both indecomposable (and bracketed) RBWs of shorter lengths 
and by induction, and hence also w (by Rule (8)), are in L. 
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Conversely, to show that £ C i?, it is only necessary to show that a sentence 
production vr beginning with the rule (RBW) — > (bracketed) will result in a 
string w that is a bracketed RBW (by Rules (4), (5), and Definition 2.1). The 
shortest sentence production vr with this property is 

(RBW) — s> (bracketed) (indecomposable) [(associate)J — ^ [xj 

which results in an RBW. In general, we proceed by induction on the number of 
productions applied in vr. The possible continuations of (RBW) (bracketed) 
are those leading to (decomposable), [(decomposable)], or [(associate)J , 
each of which in turns leads back to (bracketed). Since application of any of 
the Rules (5), (7), or (8) will result in an RBW if the non-terminals involved 
result in RBWs, the induction hypothesis and Definition 2.1 show that these 
continuations will result in a valid RBW. □ 



Because A{n) and B{n) are disjoint, and I{n) and D{n) are disjoint, the 
proof shows that given any RBW w, there is a unique sentence production 
vr ending in w. Sentences derivable from (bracketed), (indecomposable), 
(decomposable), and (associate) correspond bijectively respectively to RBWs 
that are bracketed, indecomposable, decomposable, and associate. 

For n ^ 0, let r„ (resp. a„, resp. 6„, resp. z„, resp. dn) be the number of all 
(resp. associate, resp. bracketed, resp. indecomposable, resp. decomposable) 
RBWs with n pairs of (balanced) brackets. The first few values of r„ for n = 
0, 1, 2, . . . , 5 are 

2,4,16,80,448,2688,... 



which suggests that it is the sequence A025225 from lSloane et a/.l ( 120051 ) whose 
ra-th term is given by 2"'~^^Cn . Here C„ = is the n-th Catalan number. 

By (4) and (5), we clearly have r„ = 46„ for n > 0, and hence it suffices to 
prove that 6„ = 2"~^C„. It is known that the sequence A003645 whose n-th 
term is 2"~^C„, (n ^ 1), has a generating series given by 



£2"-'C„."^i^ii^^. (9) 



n=l 



We will prove that 6„ indeed has a generating series given by Eq. 9, and more. 
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Theorem 3.1 The generating series for rn,bn,in, dn o-nd an are given by: 

1 - VI -8^ 



CO 



I{z 

A(z 



n=0 
oo 



2z 



n=0 
oo 

n=0 
oo 

n=0 

oo 

= E «n^" = 
n=0 



1 - 4^ - VI -8^ 
8i 



1-2^- VI - 8^ 
2(^ + 1) ' 

1 - 7^ + + (3^ - l)Vl-8^ 
8^(^ + 1) 

3 - 4^ - 3V1 - 8^ 
8z ■ 



(10) 



Corollciry 3.2 The number of Rota- Baxter words of degree n in the canonical 
basis of the free Rota-Baxter algebra with a single idempotent generator and 
idempotent operator is given by 

r„ = 2'^+^C„, n = 0,l,2,... 

where Cn = (^^^ is the n-th Catalan number. 

Proof: We note the initial values of the sequences are given by: 

ro = 2, 6o = 0, io = 0, do = 0, oo = 1, 
ri = 4, bi — 1, ii = 1, di — 0, ai = 3. 

Thus, it does not matter whether the generating series for B(z), and 
'D[z) start at n = or n = 1. By production rule (6), for all n > 0, wc have 
bn = in + dn- By production rule (7), an RBW w is in J(n) if and only if to = [w'l 
for some w' G A{n) or w' G D{n). Thus i„ = 36ji_i + and it follows that 
in = 4:bn-i - in-i or 

i-n ^n— 1 46„_x- 

Multiply both sides by 2;" and sum over n ^ 2. The left hand side gives 

00 00 

Y.inZ'' +Y.^n-lZ'' =1{Z) - Z + Zl{z). 



n=2 



n=2 



For the right hand side, note that by production rules (7) and (6), every 
bracketed, decomposable RBW is formed from bracketed, indecomposable RBWs 
with X inserted between them. This decomposition is clearly unique and the 
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degree of the decomposable RBW is the sum of the component indecomposable 
ones. Hence we have 

dn^ E V---^np (11) 

{n-i,...,np; n), 
P>1 

where the notation (rii, . . . , n^; n) denotes all compositions rii + • ■ • + Up — n 
of n into p positive integers, and the sum is over all compositions for all 
lengths p > 1. Therefore, noting the special case p—1 corresponds to a single 
summand in, we have 

bn^in + dn^ J2 ' ' ' H ' ^ 1) (12) 



(ni ,...,np; n), 



From this, we obtain 



oo 

B(^) = J2bnz' 



n=l 

oo 



El E 

n=l \ (ni,...,np;n),p'^l 

oo / oo \P 

E E^^-N , 

p=l \k=l / 



Since 



we have 



BW = ^. (13) 

OO 

4 5]6„_iz" = 4zB(z) 

n=2 



1(2;) - 2; + 2:1(2;) 



l-I(z) 

Solving for 1(2;) with the initial condition io = 0, we have 

, l-2z-^/l-8z 
lU = ; ^ ■ 

This proves the third formula of Eq. (10) and by Eq. (13), after rationalizing 
the denominator, we obtain the second formula of Eq. (10). The other formulas 
are easily derived from the relations T>{z) =B(2;) — 1(2;), A{z) = 1 + 3B(2;), 
and R(2;) = 1 + B(2;) + A(2;). The corollary comes from the remarks before the 
theorem. □ 
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Theorem 3.2 shows that the number i„ of bracketed indecomposable RBWs of 
degree n {n ^ 1) is the n-ih term of the sequence A062992: 

1,3,13,67,381,2307,... 

and the number dn of bracketed decomposable RBWs of degree n (n ^ 2) is a 
new sequence A115194, which starts with 

1,7,45,291,1917,12867,.... 



3.2 Generating functions of two variables 



In our computational experiment, we observed that the set B{n), when strati- 
fied by the number of ,t's appearing in an RBW, possesses certain very nice prop- 
erties that may give better combinatorial understanding of how the canonical 
basis is constructed recursively (see the algorithm in the next subsection). To 
describe the stratification, for any RBW w, recall (Definition 2.7) that w has 
arity m if the number of x's appearing in the string representation of w is 
exactly m. For any m ^ 0, let R{n, m) be the subset of R of degree n and arity 
m, and define similarly the notations A{n, m), B{n, m), I{n, m), and D{n, m). 
These are all finite sets. Let their sizes be respectively denoted by r^^^, 
bn,m) in,m) and dn,m- For initial values, we have 

'^0,0 — 1; ^0,0 — bo,o — io,o — do,o — 0; 

^^0,1 = 00,1 = 1; 6o,i = io,i = do,i = 0; 

ri,i = = ii,i = 1; ai,i = di^i = 0; 

n,2 = ai,2 = 2; bi^2 = ii,2 = c?i,2 = 0; 

n,3 = ai,3 = 1; bi,3 = ii,3 = c?i,3 = 0; 



fO,m — O-0,m — &0,m — io,m — d^^rn — for m ^ 2 

= ai,m = &l,m = ^l,m = C?l,m = for m ^ 4 

r-nfl = CLnfi = bn,0 = infi = dnfi = foT U ^ l] 

TnA = dnA = bnA = ijiA = dnA =0 forn ^ 2. 



From the production rules (4) - (8), we see that for n ^ 1, m ^ 2: 



0'n,m — 2 &n,m-l + ^'n,m-2 (15) 
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^n,m — ^n— l,m ~l~ ^n— l,m (-^'^) 

Now for n ^ 2,m ^ 2 and any w G D{n,m), we can write w uniquely as 



niXWn2 • • • xwup where G lijij) and ni + - • •+np is a composition of n using 
p positive integers. Let rrij be the arity of ■ Then clearly, mi + • — h mp = 
m — p+1 and so we may refine Eq. (11) to: 

min(n,m) 

dn,m — 5^ ^ ^ (^ni,mi) ■ ■ ■ (^np,mp)) (18) 

P=2 (mi,...,mp; m-p+1) (ni,...,np;n) 

and noting that the case p — 1 corresponds to a single summand in,m, refine 
Eqn. (12) to: 

min(?i,m) 

bn,m — ^n,m ~l~ ^^n,m ^ ' ^ ^ ^ ^ (^ni,mi) ' ' ' (^np,mp) • (19) 

P=l (mi,...,mp; m— (ni,...,np; n) 

Now from Eq. (15)-(17), we have 

^n,m dn—l,m H~ Ctji— l,m 

— ^n— l,m ^n— l,m ~l~ 2 + ^n— l,m— 2 ) 

and so 

^n,m ~l" ^n— l,m ~ ^n— l,m ~l~ 2 6ji_i^^_i + 6^_i^^_2 ■ (20) 

Define the bivariate generating series 

oo oo 
n=0 m=0 

and similarly define B(z, t), I{z, t), D(^, i), and A{z, t). Note that for B(^, i), 
1(2;, t), and'D{z, t), it does not matter whether the series indices n, m start at 
or 1. Multiply both sides of Eq. (20) by z'^t'^ and summing up forn ^ 2, m ^ 2, 
the left hand side gives: 

oo oo oo oo 

n=2 m=2 n=2 m=2 

oo / oo \ 

= i(z, i) - + ^ 5] - + ^ 

n=l \ m=l / 



oo oo 

nj.m 



= I(^, t) - - + Z ^ ^ in,m^"^ 

n=l m=l 

= I(z,i) -zf-z2i + zl(z,i). 
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Now, we sum the right hand side of Eq. (20) one term at a time. 

oo oo oo oo 

n=l m=2 



n=2 m=2 n=l m=2 

.2^ 



n=2m=2 n=l m=l 

= 2ztB{z,t) 

oc oo oo oo 

bn-l,m-2Z^t^ — Zt 'Y 'Y bn,mZ^t"^ 

n=l m=0 

oo 

= 2;t2(B(2;,t) + ^6„,o 



n=2 m=2 n=l m=0 

oo 



Z 

n=l 



Hence the right hand side sums to z{l + t)^ B{z, t) — z'^t, giving the identity 
(1 + z) I{z, t)-zt^ z{l + tf B(;2, t) (21) 
Using Eq. (19), we have 



^ ^ ^ ^ bn,mZ t 
n=l m=l 

oo oo min(n,m) 

~ 5^ 5^ $Z ^ (^ni,mi) ■ ■ ■ {inp,mp)z t 

n=lm=l p=l {mi,...,inp;m—p+l} {ni,...,np;n) 

oo oo min(n,m) 

= EE E E E (^m,^,^'^^r^)•••(^n„^,^"^r^)^^-^ 

n=l m=l p=l (rni,...,mp;m—p+l) (ni,...,np; n) 

oo / oo oo \ P 

= E EE^M^'^i 
p=i \fe=i£=i / 

and hence 

Thus we obtained the identity defining I{z,t) as 

(1 + z) I{z, t)-zt^ z{l + t) \}^^^l^^ (23) 
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Solving this quadratic equation in 1(2;, t) and using the initial conditions, we 
found 

'(^■*) = WTT) ■ '''' 

, l-2zt- 2zt^ - Vl - ^zt - Azt^ , , 

= ■ 

2t{l + tfz{l + z) ■ ^^^^ 

We can also obtain the bivariate generating series for cbn^rn. 

00 / CO \ 

n=0 \ m=2 / 

00 00/00 \ 

m=2 n=l \ m=2 / 



= ^ + E E (2 Km-l + bn,m-2)z''r 
n=l m=2 

00/00 \ 00 00 

= * + E ^ E 2 &n,m^"r + E E ^n,rn^"i^ 
n=l \ m=l / n=l m=0 

= t + 2tB{z,t) + t'^B{z,t). 
The bivariate generating function A{z,t) is given by 



^ ■ ^^^^ 

Interesting sequences and counting information can be derived from these 
functions. For example, the power scries expansion for A.[z, t) in low degrees 
in z, with coefficient in t accurate up to 0(t^^), is 

t + {2^ + t^)z + {At^ + + 2t^)z^ + (lOt^ + 25t^ + 20i^ + bt^)z^ + • • • . 

From this, we can read off that there are 60 associate RBWs in R with three 
pairs of brackets, and 10 of these have arity 4, 25 have arity 5, 20 have arity 
6, and 5 have arity 7. By expanding the series using t as the main variable, 
with coefficient in z accurate up to 0(z^°) we have 

t + 2zt + {z + Az'^)t^ + (6^^ + 10z^)t^ + (2^^ + 2bz^ + 28^^)^^ + • • • . 
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Prom this we see that there are 55 associate RBWs in it! with arity 5, and 2 of 
these have degree 2, 25 have degree 3 and 28 have degree 4. By speciahzing 
2; = 1, we obtain a sequence for the number of associates in it! with arity m, 
(m = l,2,...) 

1,2,5,16,55,202,773,3052,... 
This sequence is new and not in the Sloane data base. 

Theorem 3.3 The generating series for rn^m is 



X./ X I - Vl - 4:Zt - Azt^ 



Proof: This follows by Rule (4) using R{z, t) ^1 + B{z,t) + A{z, t) and Eqs. 
(25) and (27). □ 



Once again we have proved Theorem 3.1 (by putting t = 1 in Eqs. (24)-(28)). 
By specializing 2; = 1 to Eq. (28), we obtain the sequence A025227: 

1,2,4,12,40,144,544,2128,... , 

and thus give that sequence a new combinatorial interpretation. We easily ob- 
tain new sequences by specializing to other values, such as for ^ = 2, 3, 4, 5, . . . : 

z^ 2: 1,3, 12, 66, 408, 2712, 18912, 136488, . . . , 

z = 3: 1,4, 24, 192, 1728, 16704, 169344, . . . , 

z = 4 : 1,5, 40, 420, 4960, 62880, 835840, . . . , , 

z = 5 : 1,6, 60, 780, 1140, 178800, 2940000, . . . , . 

Moreover, this result is more refined than Theorem 3.1 and its corollary. In- 
deed, we note that Eq. (28) is related to the well-known generating series 



°° 1 _ ./I _ Az 

C{z) = Y.CnZ-- V. (29) 

n=0 
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for the Catalan sequence C^. We have clearly: 



1 - Jl - Aztil + t) 
t) = ^ = (1 + C{zt{l + 1)) 

oo 
n=0 



n=0 \i=0 



and hence 



n,m 



— < 



n + 1 



,m — Til 







I Cn if n ^ m ^ 2n + 1, n ^ 0; 



otherwise. 



(30) 



This result not only provides the proof anew that R{n) has 2"+^C„ RBWs, but 
also that these are distributed by their arities from n to 2n+l according to the 
binomial theorem. In a similar fashion, using Eq. (25) and a modified version 
of Eq. (29): 

l-2z- VI -4z 



n=l 



2z 



the doubly-indexed sequence for the number of bracketed RBWs with degree n 
and arity m has the same property, that is. 



n — 1 



,m — n. 







\Cn if n ^ m ^ 2n — 1, n ^ 1; 



otherwise. 



(31) 



This latter distribution, like the one for R{n,m), was first observed by ex- 
perimental computations, but the proof is not obvious because among the C„ 
ways to set up the structure of n pairs of balanced brackets, the number of 
ways to insert m x's to form bracketed RBWs (or RBWs in the case of R{n, m)) 
depends on the individual bracket structure (and sometimes, this number can 
be zero). An example that illustrates this observation is the set -6(3, 4) which 
has the following 10 elements. The C3 = 5 possible bracket structures (which 
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correspond to the 5 possible rooted trees with 4 vertices) are shown on the 
left. 



structure 


count 


bracketed RBWs 


LLLJJJ 


4 




LLJLJJ 


2 


|_a; |_a;J a; |_a;J J , |_ |_a;J a; |_a;J 


LLJJLJ 


2 




LJ LLJJ 


2 


|_a;J a; |_a; |_a;J J , |_a;J a; |_ |_a;J xj 


LJLJLJ 








3.3 Algorithm for generating R 



It is also interesting to note that the two- variable generating functions studied 
above allow us to obtain a new and more effective way to generate, say B(n, m) 
recursively, from constructions using bracketed RBWs alone. From Eq. (21) and 
Eq. (22), we obtain the identity satisfied by 'B(z,t): 

B{z, t) -zt^ 2zt{l + t) B{z, t) + zt{l + tf B{z, tf. (32) 

Substituting the definition of 'Q{z, t) into the above yields: 

oo oo 

E E Kmz'^t^ - Zt 

n=l m=l 

(oo oo \ / oo oo 

E E KmZ'^t"' + Zt{l + i)' E E KmZ^'f' 
n=l m=l J \n=\m=\ 

from which we obtain, for {n,m) ^ (1,1), 



— 2 6n-l,m-l + 2 &n-l,m-2 + E 



b b 



(ni, 712; n — 1) 
(mi , m2; m — 1) 



+ 2 E 



^ni,mi"n2,m2 



+ 



E 



(33) 



(ni, 712; n — 1) 
(mi, m2', m — 2) 



(ni ,7121 n — 1) 
(mi , m2 ; m — 3) 



19 



or, in explicit summation form: 



n—2 m—2 



n,m 




(34) 



n—2 m— 3 



n-2 m-4 



+ 2 E E 



'n— 



l—k,m—2—i 'I 
k=l l=\ 



'n—l—k,m—3—i ■ 



k=l e=i 



This recursion suggests that we can generate B{n, m) (and hence also i?(n, m)) 
efficiently and irredundantly from sets B{k,£) with k < n,i < m in some 
manner. This is indeed the case. 

Theorem 3.4 Given positive integers n and m, the algorithm below returns 
the sets B{n,m) {resp. I{n^m), resp. D{n,m)) of bracketed (resp. indecom- 
posable, resp. decomposable) RBWs of degree n and aritym. Forn ^ 4, B[n,m) 
is expressed as the disjoint union of sets constructed from B{k,i) with k <n 
and i < m according to Eq. (34) . 

Algorithm for bracketed RBWs of degree n and eirity m 

Input: Positive integers n, m 
Output: 

(a) the set B{n,m) of bracketed RBWs of degree n and arity m, 

(b) the set I{n,m) of indecomposable RBWs of degree n and arity m, 

(c) the set D{n,m) of decomposable RBWs of degree n and arity m. 
Step 0. If not(n ^ m ^ 2n — 1), then return three empty sets. 



Step 1. For each RBW w G B{n — 1, m — 1), form two RBWs 

/i,i(w) = [xw \ and /i,2(w) = [wx\. 
Step 2. For each RBW u G B{n — l,m — 2), form the RBWs f2{u) = [xux\ 
Step 3. For each k = 1 ... (n — 2), each £ = 1 ... m — 2, 

and each pair of RBWs (v, y) G I{k, £) x B{n — 1 — k,m — 1 — £), 

form the RBW/3(v,|/)= [vxy\ 
Step 4. For each A; = 1 ... (n — 2), each £ — 1 . . .m — 2, 

and each pair of RBWs {v, y) G D{k, £) x B{n - 1 - k,m - 1 - £) 

form the RBW fi{v,y) = [v \xy 
Step 5. For each RBW u G B{n — 1, m — 2), form the RBWs f5{u) — [x \ xu. 



Generate all bracketed (resp. indecomposable, resp. decomposable) 

RBWs with degree A; ^ 3 and arity i between k and 2k — 1. 
If n ^ 3 then return B{n,m), I{n,m), D{n,m). 
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Step 6. For each k — 1 . . .{n — 2), each £ — 1 . . .m — 3, 

and each pair of RBWs {v, y) e B{k, £) x B{n - 1 - k,m - 2 - i), 
form the two RBWs f6^i{v,y) — [xv\xy and /6,2(i',y) = [vx\xy. 

Step 7. For each k — 1 . . .{n — 2), each £ — 1 . . .m — A, 

and each pair of RBWs (v, ?/) G -B(A;, ^) x S(n — 1 — k,m — 3 — £), 
form the RBW f7{v,y) — [xvx\xy. 

Step 8. Return the union of all the RBWs formed in Steps 1-3 as I{n,m), 
the union of all the RBWs formed in Steps 4-7 as D{n, m), 
and the union of I{n, m) and D{n, m) as B{n, m). 

Proof: By Eq. (32), we know that B{n,m), I{n,m), and D{n,m) are empty 
if m < n or m > 2n — 1. The cases when n ^ 3 is taken care of in Step 0. 
So suppose n ^ 4 and n ^ m ^ 2n — 1. We first note the disjoint union 
B{n,m) = I{n,m) U D{n,m) from Eq. (6). 

Any word z in I{n,m) is of the form [z'l where z' G R{n — l,m) by Eq. (7). 
Then one and exactly one of the following statements on z' is true. 

Case-1. Either z' starts with x but does not end with x, implying 
z — [xw\ with w G B{n — 1,171 — 1); or 
z' does not start with x but ends with x, implying z — [w x\ 
with w G B{n — 1, m — 1); 

Case-2. z' starts with x and ends with x, implying z = [xux\ with 
u G B{n -l,m- 2); 

Case-3. z' neither starts nor ends with x. Then z' must be decompos- 
able and so z' G D{n — l,m). Let v be the leftmost indecom- 
posable subword of z' (see Eq. (8)) and let k be the degree of 
V and £ be the arity of v. Then 1 ^ k ^ n — 2, 1 ^ i ^ m — 2, 
and V G I{k,£). Moreover, we can write z' uniquely as vxy 
and z — [vxyl where y G B{n — 1 — k,m — 1 — £). 

This proves that I{n, m) is a subset of the set of RBWs generated by Steps 
1-3. Conversely, any RBW z generated by Steps 1-3 clearly belongs to I{n, m) 
by definition. Thus I{n, m) is precisely the set of RBWs generated by Steps 
1-3, and moreover, the sets generated in each of these steps are disjoint. This 
shows that 

n—2 m— 2 

in,m — 2 + ^n-l,m-2 + X] X] ^n-l-fe,m-l-^- (35) 

k=l 1=1 
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Now consider z e D{n,m). As in Case-3 above, z is of the form [z'\ xy for 
a unique RBW z' ( [z'\ being the leftmost indecomposable subword of z) and a 
unique bracketed RBW y. Let k be the degree of z'. Then 1 ^ A; ^ n — 2, and 
one and exactly one of the following statements on z' is true. 

Case-4. z' neither starts nor ends with an x, implying that z' is 
decomposable. Denoting z' by v, and letting i be the arity 
of V, we have z — lv\xy with 

{v, y) e D{k, tj X B{n -l-k,m -!-£), 

and 1 ^ £ ^ m - 2. 

Case-5. z' is x, implying z = [x \xy and y E B{n — 1,171 — 2). 

Case-6. Either z' starts with x but docs not end with x, implying 
z — [xv \ xy for some {v, y) e B{k,£)xB{n—l—k, m—2—€) 
where 1 ^ £ ^ m — 3; or 

z' does not start with x but ends with x, implying z = 
[vx\ xy with {v,y) G B{k,i) x B{n — 1 — k,m — 2 — €) 
where 1 ^ £ ^ m — 3; 

Case-7. z' starts with x and ends with x (but is not x), implying 
z = \_xvx\xy with (w , y) e B{k, £) x B{n — 1 — /c, m — 3 — £) 
where 1 ^ £ ^ m — 4; 

Note that Cases 4-7 correspond respectively to Steps 4-7 of the algorithm and 
generate disjoint subsets of D{n,m). This shows that D{n,m) is precisely the 
set of RBWs generated by these steps and hence 

n— 2 m— 2 

+ ^n— l,m— 2 

fe=l i=l 

(36) 

n— 2 m— 3 n— 2 m— 4 

+ E E 

k=l e=l k=l 1=1 

Combining equations (35) and (36) finishes the proof (and provides a second, 
constructive, proof of Eq. (34)). □ 
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4 One generator and one operator: arbitrary exponent case 



We now consider tlie more general cases. In tliis section, we generalize previous 
results to the cases of one generator x and one operator P = \_ \ without 
requiring these to be idempotent. Referring to Example 2.6, we have again 
q = 1, but now v = Vi is arbitrary (including Vi = oo). Moreover, we also 
restrict the set of Rota-Baxter words 971^(1, (v)) to those where the number 
of consecutive applications of the operator P is bounded by a given u (which 
may also be oo, in which case there will be no restriction at all). 



4-.1 Notations 

For this section, we now introduce new terminology and notations. For any 
RBW w, and operator -P = [ J occurring in w, a P-run is any occurrence in w 
of consecutive compositions of [ J of maximal length (that is, of immediately 
nested [ J, where the length is the number of consecutive applications of P). 
Recall from Definition 2.5 for any generator x, an x-run is any occurrence in 
w of consecutive (algebraic) products of x of maximal length. We denote a 
P-run by P^^^ or \_ \ if fx is its run length, and an x-run by x''' if u is its run 
length. When or is 1, we shall often omit the superscript. Let u, v be either 
positive integers or oo and let R^^v be the subset of RBWs (including 0) where 
the length of P-runs is ^ m and the length of x-runs is ^ v. These subsets 
are potential canonical bases of Rota-Baxter algebras on one generator. We 
have seen in Section 2 that Roo.v is the canonical basis of the free Rota-Baxter 
algebra IIl'^'-^'°(l, (f )) (Example 2.6). Also Pi^i is the canonoical basis of the 
free Rota-Baxter algebra with one idemp otent generator and o ne idempotent 



operator considered in Section 3 (see also lAguiar and Moreira] ( 120051 )). 



For convenience, we say the operator P = [ J has exponent u and the gen- 
erator X has exponent v if we are enumerating the set Ru,v This would be 
the case for Rota-Baxter algebras where the generator x satisfies x^~^^ = x 
and the operator P satisfies P^^~^^\w) = P{w) for any w. In this section, our 
enumeration on RBWs is valid for any unary operator P. It is not clear under 
what conditions a Rota-Baxter operator P would have exponent m for m ^ 2. 

For n ^ 1, let Ru^vij^) be the subset of Ru^v consisting of all RBWs of degree 
n, and for m ^ 1, let Ru,v{n, m) be the subset of Ru,v consisting of RBWs with 
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degree n and arity m. Moreover, for 1 ^ A; ^ n, we let Ru^vin-im^k) be the 
subset of Ru,vi'n, m) consisting of RBWs where the n pairs of balanced brackets 
are distributed into exactly k P-runs, and for 1 ^ £ ^ m, we let Ru^v{n, m; k, i) 
be the subset of Ru^^in^m) consisting of RBWs where the n pairs of balanced 
brackets are distributed into exactly k P-runs, and the m x's arc distributed 
into exactly £ x-runs. Except for Ru,v{n), these subsets are all finite sets, even 
when u, v are infinite, and we shall denote their corresponding cardinalities 
by replacing R by the lower case r. Thus, for example, ru,v{n,Tn; k,£) is the 
cardinality for P„^^,(n, m; k,£) and the count r„ „j of Section 3 is now denoted 
by ri m). This convention will be used for all other (finite) sets of RBWs 
we may introduce later. 

As an example for the above terms and notations, the RBW 

w — x^lxlx^l^'^^x^l — xxlxllxxx\\xx\ 

is an element in P„ ,^(3, 8; 2, 4) for any u 2,v 3 since the 3 pairs of balanced 
brackets occur in 2 P-runs of run-lengths 1 and 2, and the 8 x's occur in 4 
x-runs of run-lengths 2, 1, 3, 2. 

We also define the following generating series: 

oo 

R«,.('^) = E^-,-W^" (37) 

n=0 

oo oo 

R„,„(^, t) = E E ^uAr>', m)z^t'" (38) 

n=0 m=0 

oo oo oo 

Rn,.(;^, C) = E E E ^uAn. m- k)z-rc'' (39) 

n=0 m=0 k=Q 
oo oo oo oo 

R„,„(z, C, ^) = E E E E ^u,v{ri, m; k, €)z^t^C^e' (40) 

n=0 m=0 fe=0 i=Q 

We divide our study into three parts. In Section 4.2, we recall some results 
on compositions. In Section 4.3, we develop enumeration formulae relating 
the general cases to the idempotent case. In the remaining Section 4.4, we 
compute the generating functions, and sketch an algorithm for generating the 
sets Ru,v{n, m). 
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4-2 Compositions of an Integer 



We recall (see, for example, iMacMahonl ( 11984 )) a well-known result on compo- 



sitions (also called ordered partitions) of a positive integer m. Let G{m,i,v) 
be the set of compositions of the integer m into £ positive integer parts, with 
each part at most v and let g{m, i, v) be the size of this set. When v is finite, 
it is easy to see from the multinomial expansion of 

{t + t^ + --- + t^Y 

by collecting the coefficients of t"^ that 

g{m,£,v) = 



E 



'll ' ' ' 1 '"V J 



where ^ is the number of times P is chosen among the I factors of the 
product. It also follows that 

oo 

Y^g{m,£,v)t"' = {t + e + --- + fY. 

m=l 



Thus, for given (. and v, we find that the left-hand-side is the generating 
function Gf^^,(t) for g with respect to m, namely: 

GUt)--=t'[-^) . (41) 



In a similar but simpler way, by expanding {t + t^ + ■ ■ - Y when f = oo, we see 
that 

g{m4,^)={^^Z^^, (42) 

which is the number of compositions of m into £ parts, with no restrictions on 
the size of each part. We have an associated generating function 

G.,oo(t):= E<7(m,£,oo)r= J^'"= (737) • (^3) 

At the other extreme case, when w = 1, then g{m,i, 1) = Sm,e (Kronecker's 6) 
with Gti{t) = t^. 
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Regarding the power series ring Z[[i]] over the ring Z of integers as the com- 
pletion of the polynomial ring Z[t] in the usual sense, we have 



G^,oo(^) = hm Ge^v{t). 

This imphes that 

g{m,i, oo) = lim g{m,i,v). 
In fact, g{m, i, oo) = g{m, i, v) for all v ^ m. 

Lemma 4.1 For any power series F(t) = o,et^ 1 ^ v ^ oo, we have 

m 



Proof: We have 



oo 

e^i m=l 

m 



since g{m, i,v) — if i > m. D 
4-3 Enumeration 



Recall that in the case when the Rota-Baxter operator [ J and the generator x 
are both idempotent (that is, each of exponent 1), we had studied in Section 3 
the set i?i^i(n) = R{n), which is the set of all RBWs of degree n and the set 
m) — R{n, m), which is the set of all RBWs of degree n and arity m. It 
was shown in Eq. (28) that the doubly indexed sequence rn,m — m) has 

the generating function: 

Ri,i(;^,t):=R(^,t)= ri,i(n,m)^'^r = 

n,m>0 
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We have the disjoint union: 

n rn 

Ru,v{n,m) = Y[ U i?w,„(n,m; A;,£). (44) 

k=ie=i 

We note that in order for Ri^i[n,m; k,£) to be non-empty, we must have 
n = k and m = £, in which case, we have Ri^i{n, m; k, £) = Ri^i{k, i). For any 
w G Ri^i{k,i), w has exactly k pairs of balanced brackets, no two of which 
are immediately nested, and w has exactly i x's no two of which are next to 
each other. We define below what might be called a "collapsing map" : 

^ u,v,n,m 

: i?„,^(n, m; /c, ^) — > Ri^i{k,i). (45) 

Given w G i?„^t,(n, m; k,i), we define ^«,i,,n,m(w) to be the RBW Wi G Ri^i{k,£) 
obtained by replacing each of the k P-runs that appears in w by a single 
P and each of the x-runs appearing in w by a single x. This map is clearly 
surjective for each pair {k,£). We shall refer to wi as the collapse of w. The 
map \l/«,i,,n,m technically depends on k, £, but for simplicity, we will omit that 
in the notation. Moreover by Eq. (44), we may by abuse of language, extend 
all these maps uniquely to a single one: 

n m. 

,v,n,m 

k=l t=l 

For each RBW w G i?„^„(n, m; /c, £), w has exactly k P-runs and exactly £ x- 
runs. For 1 ^ i ^ /c, let rij be the run-length of the i-th P-run of w and for 
1 ^ J ^ £, let ruj be the run-length of the j-th x-run of w. We have 1 ^rii 
(since P has exponent u) and 1 ^ rrij ^ v (since x has exponent v). Then 
to each w there corresponds the triplet {wi,n,fh) where wi — '^u,v,n,rn{w) G 
Ri,i{k, £), a composition n — {rii, . . . , Uk) of n into k parts with each part rii 
no bigger than u, and a composition m — (mi, . . . , m^) of m into £ parts with 
no part rrij bigger than v. Conversely, every such triplet determines a unique 
w G Ru,y{n,m.;k,£) such that ^u,t,,n,m('"^) — Using the sets G{'m,£,v) for 
compositions defined in Section 4.2, we have a bijection: 

Ru,v {n,m;k,£) * — > Ri^i {k,£) x G{n,k,u) x G{m, £, v) (46) 

Theorem 4.2 With the notations established in Sections 4.1, 4.2 and above, 
the numbers r„^„(n, m) o/RBWs of degree n and arity m in the set R^^^ o/RBWs 
with one generator x of exponent v and one operator P of exponent u are given 
by: 
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(1) For 1 ^ M ^ oo, r„,i(n, m) = ^ g{n, k, u)ri^i{k, m); 

k=l 

m 

(2) For l^v ^oo, rxy{n,m) = ^g{m,i,v)ri^i{n,i); 

(3) For 1 ^ u,v ^ oo, r„^„(n, m) is given by any of the three expressions: 

m 

(a) ^c/(m,£,w)r„,i(n,£), 

n 

(b) ^g{n,k,u)ri^y{k,m), 

k=l 
n m 

(c) Hain, k, u)g{m, £, v)ri,i{k, £). 

k=l i=l 

Proof: The proofs are all similar. For example, 3(c) follows from the bijection 
in Eq. (46) and the disjoint union in Eq. (44). □ 

Corollary 4.3 Suppose one or both of u and v are oo. Then we have 
ri,oo(n,m) = f; (7 .JL^jCfe 



e=i 



— 1 J \n — ^ 



4-4 Generating Functions and Enumeration Algorithm 

We now find the generating functions R„ ,;(^, t) of the number sequences 
ru,v{'n,m) for 1 u, v ^ oc (sec Eqs. (38)-(40) for definitions and notations). 
Recall from Eqs. (28)-(29), we have 



1 - - 4:Zt(l + t) 

R,,^{z, t) = R{z, t) = ^ = (1 + t)C{zt{l + t)) 



where 



oo 



c(^) = Y.Q 



7 



1 - VI -4^ 



1z 

n=0 



is the generating function of the Catalan numbers. 
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Theorem 4.4 Let 1 ^ u,v ^ oo. The generating function Hu,v{z,t) for the 
number ru,v{n I m) o/RBWs in a Rota-Baxter algebra with one operator P with 
exponent u and one generator x with exponent v is given by: 

(1) R„,i(^,t) =Ria(Gi,„(z), t). 

(2) Ri,,(2;,t) =Ria(2;,Gi,,(t)). 

(3) RuA^, t) = Ri,i (GiA^), Gi,,(t)) . 

where Gi_„ and Gi_„ are given by Eq. (41) {for finite u, v) and by Eq. (43) 
{for infinite u,v). 

Proof: The proofs follow from Theorem 4.2 and Lemma 4.1. We just prove 
case (2): 

RiA^,t) = ^ ri,,(n,m)^'^r 

n,m^l 

m 

= E T.9{m,i,v)r,An,i)z''t^ 

n,m^l £=1 

= E (f:9{m,e,v) lj2r,An,e)zA]t"^ 

= Ri,i {z, Gi,„(t)) . □ 

Corollciry 4.5 Suppose one or both ofu,v are oo, then the generating func- 
tion for ruA''^^''^) given by 



R'0o,l(^, ^) — R-i,! (^ ^ _ ^ ^^^ ' 



Ri,co{z,t) — Ri^i yz, Y^fJ ' 

R'00,oo(^, ^) = R-l,! (^-^ 'z' 1 — t 



Wc end this section with a brief description for an algorithm to enumerate 
the sets Ru^v{n, m). The details will be left out since by means of the disjoint 
union in Eq. (44) and the bijection in Eq. (46), this is fairly straight for- 
ward. We already have an algorithm (see Theorem 3.4) for the enumeration 
of RiAk-it) for any positive /c, £. We need an algorithm to generate all the 
compositions of m in G{m,i,v), which would of course generate G{n,k,u), 
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too. Now the set of compositions m = (mi, . . . , mi) of m into exactly i parts 
without restrictions on the parts can be enumerated by readily available, ef- 
ficient, and well-known a lgorithms (see COMP_NEXT of SUBSET library in 
Nijenhuis and Will] ( 11978 ) for example). Those compositions whose parts vio- 
late the restrictions rrij ^ v can be easily discarded by modifying the code. 



5 The general case: multiple generators and operators 



In this section, we consider sets of RBWs with p unary operators Pi, . . . ,Pp 
and q generators Xi, . . . ,Xq. Since our purpose here is enumeration, we will for 
now consider formal bracketed words with brackets [j J j corresponding to Pi 
{1 ^ i ^ p) and ignore any other properties of the operators. 

While it is possible to give a definition generalizing that of Definition 2.1, it 
is more straight-forward to generalize the grammar of Section 3. Since we do 
not need such details, we leave the specification of the grammar to the reader. 

We adopt the convention that a vector quantity using the same symbol as the 
corresponding scalar quantity will have components with the same symbol 
but subscripted. For example, P = (Pi, . . . , Pp) and ). The P- 

exponent vector u = {ui, . . . , Up) will mean that the operator Pj has exponent 
Ui, 1 ^ i ^ p, and this means that we only consider RBWs in which the number 
of consecutive applications of the operator Pj is bounded by Ui for each i. 
We define Pj-runs (resp. Xj-runs) similarly to P-runs (resp. x-runs), treating 
each Pj (resp. Xj) as single operator (resp. generator). We shall call a run of 
P's (with whatever subscripts) a P-run, and similarly define an x-run. Sets of 
RBWs are defined also with the parameters vectorized. For a numerical vector 
such as n, we let \n\ denote its norm, which is the sum of its components. 
Thus, without further explanation, \rn\ will be the total x-arity of a RBW w 
whose x-arity vector is fh. As an example, when p = q = 2, the RBW 

w = xlx2PiP2^\xiX2Pi{xi)) 

(47) 

= X1X1X1X2X2X2X2 [1 [2 [2 [2X1X2 [iXiJ ij 2J 2J 2J 1 

has three Xi-runs of lengths 3,1, and 1; two X2-runs of lengths 4 and 1; three 
x-runs of lengths 7, 2, and 1; two Pi-runs of lengths 1 and 1; one P2-run of 
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length 3, and two P-runs of lengths 4 and 1. The P-degree vector of w is (2,3), 

— * 

its P-degree is 5, its x-degree vector is (5, 5), and its x-arity is 10. 

For any given positive integers p, q, and corresponding P-exponent vector u, 
a>exponent vector v, let P^^^; denote the set of RBWs with p operators and q gen- 
erators with corresponding exponents vectors u and v respectively. The values 
p, q are implicitly given by the dimensions of the vectors u and v respectively. 
In particular Poo,oo is the set of all RBWs in the setting with one generator (of 
exponent cxo) and one operator (also of exponent oo). Thus Poo,oo is the canon- 
ical basis for III^'^'°(k[a;]) (see Example 2.2). Without further explanations, 
the notations established in Section 4.1 will be generalized to their vectorized 
versions in the obvious way. For example, P^,i;(n, m; k, i) will denote the set of 

— * — * 

all RBWs of Rii^ff with P-degree n distributed into exactly k P-runs, and x-arity 
m distributed into exactly i x-runs. Furthermore, the cardinality of this set 
is denoted by rjj,i;(n, m; /c, ^) and analogous to Eq. (44), we have the disjoint 
union: 

n m 

Ra,v{n, m)^Y[Y[ ^aA^^ m;kj). (48) 
k=ie=i 

5. 1 The forgetful maps and coloring 

We define another family of maps to relate the general case to the single 
operator and single generator case we have solved. Basically, the forgetful maps 
removes all the subscripts on the operators and generators. By composing the 
forgetful maps with the collapsing maps, we further relate the general case 
with the case when the single operator and single generator have exponent 1. 
These maps capture the structure of runs of RBWs in the general case. We now 
set up the notations to make these precise. 

We define the "forgetful" maps: 

^u,v,n,m '• Ru,v{P'i k^ £) > Poo,oo 

{n,m;k,£) (49) 

by defining $u,,r,n,m('W^)) ^oi each w e Rii^ij{n,m; k,£), to be the RBW Woo in 
^00,00 when every Pi = [i \i, 1 ^ i ^ p that appears in w is replaced by 
P — I \ and every Xj,l ^ j ^ q that appears in w is replaced by x. For 
example, for the word w in Eq. (47), we have 

<^u,v,n,mH = X'P^'\X'P{X)) = x\x\x\\^'\ 
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The map ^u,v,n,m is clearly surjective and depends on p, g, u, v, n, m, k, £, but 
for notational brevity, we will not explicitly mention p,q,k, and i. Again, by 
abuse of language and the disjoint unions in Eq. (48) and Eq. (44), we may 
extend $u,i;,n,m uniquely to a surjection: 

^u,v,n,m ■ Ru,v{P'i^f^^ -^00,00 ('^j '^)- 

We may further extend ^u,v,n,m to a surjection: 

^U,V ■ Ru,V ^ -^00,00- 

It is convenient to refer to the RBW w^o = ^u,v{'w) as the monochrome image 
of w and say w is a coloring of Woo- A special case of particular importance 
is the following. 

Consider an x-run w which we may write in the form x'^l ■ ■ ■ x^f^ where (3 is 
some positive integer and for 1 ^ (i ^ /3, we have 1 ^ j'd ^ g, 1 ^ &d ^ Wj^, 
and if 0? < then 7^ jd+i- The monochrome image of w will be x^, where 
h — hi -\- ■ • ■ -\- hjj. Conversely, for any positive integer 6, any coloring of is 
an x-run w of this form. Thus there is a bijcction a — (Jb,q,v between the set of 
x-runs w whose monochrome image is and the set C{b, q, v) of colorings of b 
identical objects in a row using up to q colors, say colors 1,2, ... ,q, repetitions 
allowed, so that for 1 ^ j ^ g, any run of color j has length no longer than 
Vj. Explicitly, given w, we define a coloring a{w) e C{b, q, v) of the b objects 
using color ji for the first bi objects, then color j2 for the next 62 objects, etc. 
The number of such colorings is denoted by c{b, q, v) . 

Similarly, for any positive integer a, we have a bijection tt = iia^p^u between 
the set of P-runs whose monochrome image is P*-"^-* and the set of colorings 
C{a,p,u). We can also apply these coloring maps to x-runs and P-runs oc- 
curring in any RBW w G Rit^^{n, m; k, €) whose monochrome image is a fixed 
Woo = ^u,v,n,ra{w). Morc explicitly, given G R^^^{n,m; k,£), let {wi,n,m) 
be the triplet corresponding to Woo according to the bijection from Eq. (46) 
for the case u = v = 00, so that n = (ni, .... n^) is a composition of n into k 
parts, fh = (mi, . . . , mi) is a composition of m into £ parts, and, in particular. 
Woo collapses to wi G Ri^i{k,£) (that is, 00,00, n,m{w 00) = wi). This implies 
that for all w G {^■a,v)^'^{woo) = i^u,v,n,my^{'fJoo), w has exactly k P-runs of 
lengths rii, . . . ,nk, providing the colorings via the maps 7ini,p,u for the corre- 
sponding P-runs in the monochromatic image Woo and similarly w has exactly 
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£ x-runs of lengths mi, . . . ,m^, providing the colorings via the maps CFmj,q,v 
for the corresponding x-runs in Woo- This yields the bijection: 



(50) 



By Eq. (46), as Woo runs through the set Roo,oo{n, m; k, i), its triplet {wi, n, m) 
runs through i?i i(/c,£) x G{n, k, oo) x G{m,i, oo). Thus we have a bijection: 



P ■ Ru,v{n, m; k, £) = {^u,v,n,m) ^{Roo,oo{n, m; k, £)) < — ^ 
Ri,i{Ki)x U (f[C{n,,p,u)\x II (n 

neG{n,k,oo) \i=l / m eG{m,£,oo) \j=l 

We gather below two results on coloring. 



C{mj,q,v) 



(51) 



Lemma 5.1 Let v — {v, . . . , u) be a vector with q identical coordinates where 
u is either an integer ^1 or oo . Then 



c{b, q,v)^ < 



if g = 1 and 6 ^ z/; 
if g = 1 and b > u; 



(52) 



J29{b,P,iy)q{q-iy-' if O 2. 

/9=1 



If u is infinite, then this simplifies to: 

c{b,q, oo) = g^ 



(53) 



Proof: The case when = 1 is obvious. Suppose q'^ 2 and v is finite (resp. in- 
finite). The number /3 of same color runs in a coloring of b objects can be at 
most b. For each composition bi + • + bjs — b in G{b,l3,v), the run- lengths 
bi,...,bi3, which are uniformly bounded by u (resp. unbounded), are fixed 
(therefore, the locations of the /3 runs are also fixed). The uniform bound 
(resp. unboundedness) allows any run be assigned any color, except that adja- 
cent runs must have different colors. Thus there are q{q — l)^~^ ways to choose 
the colors. Any two such colorings with distinct parameters are distinct. This 
proves the first statement. 

Now suppose v is infinite. Then we can color any of the b objects with any of 
the q colors and there are q'^ ways. Alternatively, the second statement also 
follows from the first using Eq. (42) and the Binomial Theorem. □ 
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Corollciry 5.2 Let v — {i/,...,i>) be the same as in Lemma 5.1. Then the 
generating series for c{b, q, v) for fixed q and v is given by 



cut) = < 

J-(?-l)Gi,,(i) 

If v = oo, then this simplifies to 



Gi,,(i) if g = 1; 

qGiAt) 



a q^2. 



c -.it)-j^' 



qf 



Proof: The case q = 1 is clear. For q ^ 2, the generating function is 

oo / b \ 

C,,,{t) = Y.[Y^g{b,(3,i.)q{q-ir-']t' 

6=1 \/3=l J 

which, by Lemma 4.1 apphed to 

oo 



,3-l,/3 _ 5^ 



is F(Gi,i/(t)), as required. The = oo case follows from Eqs. (53) and (43). □ 

We say the operators (resp. generators) have uniform exponents (resp. v) 
if all the coordinates of the P-exponent (resp. x-exponent) vector u (resp. v) 
are equal to /i (resp. u), in which case we write u — fx (resp. v — v). In the 
next two subsections, we consider the infinite exponent case, followed by the 
finite uniform exponent case. 

5.2 Uniform exponents: Infinite cases 

Theorem 5.3 Let i?^^^ he the set of Rota-Baxter words with p operators P 
and q generators x, with corresponding uniform exponent vectors u and v 
respectively. Then for any positive integers n, m, we have 

(1) ^(n,m) = q"^ ri oo{n,m) , 

(2) r-. ^(n, m) = roo,i(n, m), 
^SS,^(^'"^) =P"5'"^oo,oo(ri,m), 

where ri^oo{n,m), roo,i{n,m), and roo,oo{'n,m) are given by Corollary 4.3. 
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Proof: We will prove only (3). Let Wqo £ Roo,oo{'>T', m;k,i). Each element 
w e ^' whose monochromatic image is is obtained from Woo 

by coloring the P-runs and x-runs in w^o- For any positive integer j, let be 
the set consisting of the first j natural numbers. Since the number of mappings 
from to is and the number of mappings from to Ng is g"*, the 
result follows. Alternatively, this theorem also follows by summing over all k 
and i the cardinalities from Eq. (51) and using Corollary 4.3. □ 

Corollciry 5.4 The generating functions for the double sequences 'r'l^iji, m), 

r^^{n,m), and r^^{n,m) are, respectively, 



(1) R,^^iz,t)^R,,(^z, 

(2) R^.{z,t) = R,,, (jZ^'^). ^rid 



(3) K^^{z,t) = R,J-^,-^]. 
^ ' ' ' ' \l-pz' l-qt) 

Proof: By the above theorem and Corollary 4.5, for case (3), we have 

oo oo 

^^^M = E Ep"9"'^oo,oo(n,m)z"r 



n=l m=l 



oo oo 



= E E T^oo,oo{n,m){pzY{qty 

n=l m=l 



' \1 — pz 1 — qt J 



5.3 General case 



Theorem 5.5 Let R^^^ he the set of Rota-Baxter words with p operators P 
having exponent vector u, and q generators x having exponent vector v, where 
u {resp. v) may have finite or infinite components. Let Cp,^(2;) {resp. Cg,i;(t)) 
be the generating function for the sequence c(a, p, u) as a varies {resp. c{b, q, v) 
as b vanes) . Then the generating function for r^^fjin., m) is 



R'U,iT(^, t) — Ri,i(Cp_ir(^)) ^q,v{t))- 
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Proof: The special cases when either p — 1, or g = 1, or both p — q — 1 
speciahze to corresponding cases of Theorem 4.4. Thus we shall assume p ^ 2 
and g ^ 2. Some special cases when u —1 or oo and when v —1 or oo are 
included in Corollaries 4.5 and 5.4. 

Consider the forgetful map ^u,v,n,m of Eq. (49), which is surjective: 



We have obviously Ru,^{n, m; k, i) = ^ {Roo,oc,{n, m; k,i)), and using 

the bijection from Eq. (51), we have 

ru,v{n, m; k, £) 

\n eG(n, fe, oo) i=l / \rh eG{m, i, oo) j=i 



RecaUing Eq. (44), we now calculate the generating function. 



n,m,k,£ \n £G{n,k,oo) i=l ) \m GG(m,i^,oo) j=l 



Working on the last parenthesized expression, we have 



E 



E Ilc("ij,?,^^) 

Tfi 6G(m,^,cx)) j=l 



r = E 



E n c(rn„ ^T)r^ 

m eG(m,^,oo) j=l 



= \ Y,c{h,q,v)t'> 



The result now follows from the definition of Ri,i(2;, t). □ 
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Corollciry 5.6 // the exponent vectors u, v are uniform, say u — jl and v — V 

for some finite or infinite constants /i, v, then the generating function for 
rji,i;{n,m) is: 

pGi,^{z) gGi,.(^) \ 

1 - (p - l)Gi,^(z) ' 1 - {q - l)G,,^it) ) ■ 



t) — Ri, 



Proof: This follows from Lemma 5.1 and Corollary 5.2. □ 



6 Conclusion and outlook 



We have obtained generating functions and algorithms related to enumeration 
of sets of Rota-Baxter words in various generalities. It is interesting to see 
the close relation with Catalan numbers, further revealing the combinatorial 
nature of Rota-Baxter algebras in cases where the sets form canonical bases. 

The cases we have considered allow multiple operators and multiple generators 
with uniform exponents. The case where the operators and generators are 
allowed to have variable exponents are in principle solved and it only remains 
to compute the generating functions for the number of colorings. Since the 
coloring problem is a constraint satisfaction problem, we are investigating this 
connection to see if the result is alrady known. We expect these results to be 
useful in the construction of more general Rota-Baxter type algebras, where 
the unary operators may satisfy other identities instead. 

We mentioned a few new integer sequences in this study. Clearly, since the 
most general generating functions given by Theorem 5.5 are parametrized by 
two arbitrary vectors, we expect many specialized sequences to be unknown 
in the Sloane database. The connections between sets of Rota-Baxter words 
and other sets of combinatorial objects when their counting sequences are the 
same will also be of tremendous interest in expanding the realm of symbolic 
computation because such connections will allow the representation of these 
other sets of combinatorial objects using Rota-Baxter words which are purely 
algebraic in nature. 
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